A Hybrid Multi-group Privacy-Preserving Approach for Building Decision Trees
نویسندگان
چکیده
In this paper, we study the privacy-preserving decision tree building problem on vertically partitioned data. We made two contributions. First, we propose a novel hybrid approach, which takes advantage of the strength of the two existing approaches, randomization and the secure multi-party computation (SMC), to balance the accuracy and efficiency constraints. Compared to these two existing approaches, our proposed approach can achieve much better accuracy than randomization approach and much reduced computation cost than SMC approach. We also propose a multi-group scheme that makes it flexible for data miners to control the balance between data mining accuracy and privacy. We partition attributes into groups, and develop a scheme to conduct groupbased randomization to achieve better data mining accuracy. We have implemented and evaluated the proposed schemes for the ID3 decision tree algorithm.
منابع مشابه
A hybrid intuitionistic fuzzy multi-criteria group decision making approach for supplier selection
Due to the increasing competition of globalization, selection of the most appropriate supplier is one of the key factors for asupply chain management’s success. Due to conflicting evaluations and insufficient information about the criteria, Intuitionisticfuzzy sets (IFSs) considered as animpressive tool and utilized to specify the relative importance of the criteria. The aim of this paper is to...
متن کاملDecision Tree Classifier for Privacy Preservation
In recent year’s privacy preservation in data mining has become an important issue. A new class of data mining method called privacy preserving data mining algorithm has been developed. The aim of these algorithms is to protect the sensitive information in data while extracting knowledge from large amount of data. We focus the general classification in a secured manner and introduce a privacy-p...
متن کاملBuilding Privacy-preserving C4.5 Decision Tree Classifier on Multi- Parties
In this paper, we address Privacy-preserving classification problem in a multi-party sense. We focus the general classification in a secured manner and introduce a Privacy-preserving decision tree classifier using C4.5 algorithm without involving third party. C4.5 algorithm is a software extension of the basic ID3 algorithm designed by Quinlan. Our protocol is considerably more efficient than a...
متن کاملPrivacy Preserving Data Mining using Random Decision Tree
Data processing with information privacy and information utility has been emerged to manage distributed information expeditiously. In this paper, to deal with this advancement in privacy protective data processing technology victimization intensify approach of Random Decision Tree (RDT). Random Decision Tree provides higher potency and information privacy than Privacy secured Data mining Techni...
متن کاملA centralized privacy-preserving framework for online social networks
There are some critical privacy concerns in the current online social networks (OSNs). Users' information is disclosed to different entities that they were not supposed to access. Furthermore, the notion of friendship is inadequate in OSNs since the degree of social relationships between users dynamically changes over the time. Additionally, users may define similar privacy settings for their f...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2007